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Abstract 


This paper presents an algebraic theory of instruction sequences 
with instructions for Turing tapes as basic instructions, the behaviours 
produced by the instruction sequences concerned under execution, and 
the interaction between such behaviours and Turing tapes provided 
by an execution environment. This theory provides a setting for the 
development of theory in areas such as computability and computa- 
tional complexity that distinguishes itself by offering the possibility of 
equational reasoning and being more general than the setting provided 
by a known version of the Turing-machine model of computation. The 
theory is essentially an instantiation of a parameterized algebraic theory 
which is the basis of a line of research in which issues relating to a 
wide variety of subjects from computer science have been rigorously 
investigated thinking in terms of instruction sequences. 
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1 Introduction 


This paper introduces an algebraic theory that provides a setting for the 
development of theory in areas such as computability and computational 


This work is licensed under the Creative Commons Attribution-NoDerivatives 4.0 
International License 

‘Informatics Institute, Faculty of Science, University of Amsterdam, Science Park 904, 
1098 XH Amsterdam, the Netherlands, email: {J.A.Bergstra,C.A.Middelburg}@uva.nl. 


114 J.A. Bergstra, C.A. Middelburg 


complexity. The setting in question distinguishes itself by offering the pos- 
sibility of equational reasoning and being more general than the setting 
provided by a known version of the Turing-machine model of computation. 
Many known and unknown versions of the Turing-machine model of com- 
putation can be dealt with by imposing apposite restrictions. We expect 
that the generality is conducive to the investigation of novel issues in areas 
such as computability and computational complexity. This expectation is 
based on our experience with a comparable algebraic theory of instruction 
sequences, where instructions operate on Boolean registers, in previous work 
(see [7, 8, 9, 10, 11, 13]). 

It is often said that a program is an instruction sequence. If this 
characterization has any value, it must be the case that it is somehow easier 
to understand the concept of an instruction sequence than to understand 
the concept of a program. In tune with this, the first objective of the work 
on instruction sequences that started with [4], and of which an enumeration 
is available at [21], is to understand the concept of a program. The notion of 
an instruction sequence appears in the work in question as a mathematical 
abstraction for which the rationale is based on this objective. 


The structure of the mathematical abstraction at issue has been deter- 
mined in advance with the hope of applying it in diverse circumstances where 
in each case the fit may be less than perfect. Until now, the work in question 
has, among other things, yielded an approach to non-uniform computational 
complexity where instruction sequence length is used as complexity measure, 
a contribution to the conceptual analysis of the notion of an algorithm, 
and new insights into such diverse issues as the halting problem, program 
parallelization for the purpose of explicit multi-threading and virus detection. 


The basis of all the work in question (see [21]) is the combination of 
an algebraic theory of single-pass instruction sequences, called program 
algebra, and an algebraic theory of mathematical objects that represent 
the behaviours produced by instruction sequences under execution, called 
basic thread algebra, extended to deal with the interaction between such 
behaviours and components of an execution environment. This combination is 
parameterized by a set of basic instructions and a set of objects that represent 
the behaviours exhibited by the components of an execution environment. 


The current paper contains a simplified presentation of the instantiation 
of this combination in which all possible instructions to read out or alter the 
content of the cell of a Turing tape under the tape’s head and to optionally 
move the head in either direction by one cell are taken as basic instructions 
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and Turing tapes are taken as the components of an execution environment. 
The rationale for taking certain instructions as basic instructions is that the 
instructions concerned are sufficient to compute each function on bit strings. 
However, shorter instruction sequences may be possible if certain additional 
instructions are taken as basic instructions. That is why we opted for the 
most general instantiation. 

An instantiation in which instructions to read out or alter the content of 
a Boolean register are taken as basic instructions and Boolean registers are 
taken as the components of an execution environment turned out to be useful 
to rigorous investigations of issues relating to non-uniform computational 
complexity and algorithm efficiency (see e.g. [8, 10]). We expect that the 
instantiation presented in this paper can be useful to rigorous investigations 
relating to uniform computational complexity and algorithm efficiency. 

Program algebra and basic thread algebra were first presented in [4].? 
The extension of basic thread algebra referred to above, an extension to 
deal with the interaction between the behaviours produced by instruction 
sequences under execution and components of an execution environment, 
was first presented in [5]. The presentation of the extension is rather involved 
because it is parameterized and owing to this covers a generic set of basic 
instructions and a generic set of execution environment components. In the 
current paper, a much less involved presentation is obtained by covering 
only the case where the execution environment components are Turing tapes 
and the basic instructions are instructions to read out or alter the content 
of the cell of a Turing tape under the tape’s head and to optionally move 
the head in either direction by one cell. 

After the presentation in question, we make precise in the setting of the 
presented theory what it means that a given instruction sequence computes 
a given partial function on bit strings, introduce the notion of a single-tape 
Turing-machine program in the setting, give a result concerning the compu- 
tational power of such programs, and give a result relating the complexity 
class P to the functions that can be computed by such programs in polyno- 
mial time. We also give a simple example of a single-tape Turing-machine 
program. This example is only given to illustrate the close resemblance of 
such programs to transition functions of Turing machines. The notation that 
is used for Turing-machine programs is intended for theoretical purposes 
and not for actual programming. 


In that paper and the first subsequent papers, basic thread algebra was introduced 
under the name basic polarized process algebra. 
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This paper is organized as follows. First, we introduce program algebra 
(Section 2) and basic thread algebra (Section 3) and extend their combination 
to make precise which behaviours are produced by instruction sequences 
under execution (Section 4). Next, we present the instantiation of the 
resulting theory in which all possible instructions to read out or alter the 
content of the cell of a Turing tape under the tape’s head and to optionally 
move the head in either direction by one cell are taken as basic instructions 
(Section 5), introduce an algebraic theory of Turing-tape families (Section 6), 
and extend the combination of the theories presented in the two preceding 
sections to deal with the interaction between the behaviours of instruction 
sequences under execution and Turing tapes (Section 7). Then, we formalize 
in the setting of the resulting theory what it means that a given instruction 
sequence computes a given partial function on bit strings (Section 8) and give 
as an example an instruction sequence that computes the non-zeroness test 
function (Section 9). Finally, we make some concluding remarks (Section 10). 

In this paper, some familiarity with algebraic specification, computabil- 
ity, and computational complexity is assumed. The relevant notions are 
explained in many handbook chapters and textbooks, e.g. [16, 23, 27] for no- 
tions concerning algebraic specification and [2, 19, 24] for notions concerning 
computability and computational complexity. 

Sections 2-4 are largely shortened versions of Sections 2—4 of [12], which, 
in turn, draw from the preliminary sections of several earlier papers. 


2 Program Algebra 


In this section, we introduce PGA (ProGram Algebra). The starting-point 
of PGA is the perception of a program as a single-pass instruction sequence, 
i.e. a possibly infinite sequence of instructions of which each instruction is 
executed at most once and can be dropped after it has been executed or 
jumped over. The concepts underlying the primitives of program algebra 
are common in programming, but the particular form of the primitives is 
not common. The predominant concern in the design of PGA has been to 
achieve simple syntax and semantics, while maintaining the expressive power 
of arbitrary finite control. 

It is assumed that a fixed but arbitrary set A of basic instructions 
has been given. A is the basis for the set of instructions that may occur 
in the instruction sequences considered in PGA. The intuition is that the 
execution of a basic instruction may modify a state and must produce the 
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Boolean value 0 or 1 as reply at its completion. The actual reply may 
be state-dependent. In applications of PGA, the instructions taken as 
basic instructions vary from instructions relating to Boolean registers via 
instructions relating to Turing tapes to machine language instructions of 
actual computers. 

The set of instructions of which the instruction sequences considered in 
PGA are composed is the set that consists of the following elements: 


e for each a € A, a plain basic instruction a; 

e for each a € A, a positive test instruction +a; 
e for each a € A, a negative test instruction —a; 
e for each 1 € N, a forward jump instruction #1; 
e a termination instruction !. 


We write Z for this set. The elements from this set are called primitive 
instructions. 

Primitive instructions are the elements of the instruction sequences 
considered in PGA. On execution of such an instruction sequence, these 
primitive instructions have the following effects: 


e the effect of a positive test instruction +a is that basic instruction a is 
executed and execution proceeds with the next primitive instruction 
if 1 is produced and otherwise the next primitive instruction is skipped 
and execution proceeds with the primitive instruction following the 
skipped one — if there is no primitive instruction to proceed with, 
inaction occurs; 


e the effect of a negative test instruction —a is the same as the effect 
of +a, but with the role of the value produced reversed; 


e the effect of a plain basic instruction a is the same as the effect of +a, 
but execution always proceeds as if 1 is produced; 


e the effect of a forward jump instruction #/ is that execution proceeds 
with the /th next primitive instruction — if | equals 0 or there is no 
primitive instruction to proceed with, inaction occurs; 


e the effect of the termination instruction ! is that execution terminates. 
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Inaction occurs if no more basic instructions are executed, but execution 
does not terminate. 

PGA has one sort: the sort IS of instruction sequences. We make this 
sort explicit to anticipate the need for many-sortedness later on. To build 
terms of sort IS, PGA has the following constants and operators: 


e for each u € Z, the instruction constant u:—> IS; 
e the binary concatenation operator _;_:IS x IS— IS; 
e the unary repetition operator _”:IS > IS. 


Terms of sort IS are built as usual in the one-sorted case. We assume that 
there are infinitely many variables of sort IS, including X,Y, Z. We use infix 
notation for concatenation and postfix notation for repetition. 

A PGA term in which the repetition operator does not occur is called 
a repetition-free PGA term. 

One way of thinking about closed PGA terms is that they represent 
non-empty, possibly infinite sequences of primitive instructions with finitely 
many distinct suffixes. The instruction sequence represented by a closed term 
of the form t; t’ is the instruction sequence represented by t concatenated 
with the instruction sequence represented by t’.2 The instruction sequence 
represented by a closed term of the form t” is the instruction sequence 
represented by ¢t concatenated infinitely many times with itself. A closed 
PGA term represents a finite instruction sequence if and only if it is a closed 
repetition-free PGA term. 

The axioms of PGA are given in Table 1. In this table, u, uj,..., uz and 
U1,---,Upr41 stand for arbitrary primitive instructions from Z, k, k’, and l 
stand for arbitrary natural numbers from N, and n stands for an arbitrary 
natural number from N,.4 For each n € Nj, the term t”, where t is a PGA 
term, is defined by induction on n as follows: t! = t, and t?+! =t;t”. 

Let ¢ and t’ be closed PGA terms. Then t = ?¢’ is derivable from the 
axioms of PGA iff t and ¢’ represent the same instruction sequence after 
changing all chained jumps into single jumps and making all jumps as short 
as possible. Moreover, t = t’ is derivable from PGA1—PGA4 iff ¢ and ¢’ 
represent the same instruction sequence. We write PGA'* for the algebraic 


3The concatenation of an infinite sequence with a finite or infinite sequence yields the 
former sequence. 
“We write Ni for the set {n € N| n> 1} of positive natural numbers. 
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Table 1: Axioms of PGA 


(X;Y);7=X3(Y;Z) PGA1 
(x")e = xe PGA2 
X¥ Y= xe PGA3 
(X;YY=axs(v; x) PGA4 


#k+1;u,;...; up; #0 = #0; u,;...; un; #0 PGA5 
#k4+13;u1;...;un; #l = #l+k4+13ur;...;un;#l PGAG 


(F:I+k4+15u1;...5 up)” = (#l3ur3...5 ug)” PGA7 
FUAKAR +2 515... 5 Uk (Urge ga)? = 
#l+kK41 5 uy 3...5 Ups (Ur3-. 5 UR ea)” PGA8 


theory whose sorts, constants and operators are those of PGA, but whose 
axioms are PGA1—PGA4. 

The informal explanation of closed PGA terms as sequences of primitive 
instructions given above can be looked upon as a sketch of the intended 
model of the axioms of PGA®°. This model, which is described in detail in, 
for example, [6], is an initial model of the axioms of PGA'S*. Henceforth, 
the instruction sequences of the kind considered in PGA are called PGA 
instruction sequences. 


3 Basic Thread Algebra for Finite and Infinite 
Threads 


In this section, we introduce BTA (Basic Thread Algebra) and an extension 
of BTA that reflects the idea that infinite threads are identical if their 
approximations up to any finite depth are identical. 

BTA is concerned with mathematical objects that model in a direct 
way the behaviours produced by PGA instruction sequences under execution. 
The objects in question are called threads. A thread models a behaviour 
that consists of performing basic actions in a sequential fashion. Upon 
performing a basic action, a reply from an execution environment determines 
how the behaviour proceeds subsequently. The possible replies are the 
Boolean values 0 and 1. 

The basic instructions from A are taken as basic actions. Besides, tau 
is taken as a special basic action. It is assumed that tau ¢ A. We write Atay 
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for AU {tau}. 

BTA has one sort: the sort T of threads. We make this sort explicit to 
anticipate the need for many-sortedness later on. To build terms of sort T, 
BTA has the following constants and operators: 


e the inaction constant D:—- T; 
e the termination constant S:—> T; 


e for each a € Atay, the binary postconditional composition operator 
_dab_:Tx ToT. 


Terms of sort T are built as usual in the one-sorted case. We assume that 
there are infinitely many variables of sort T, including x, y, z. We use infix 
notation for postconditional composition. We introduce basic action prefixing 
as an abbreviation: aot, where a € Atay, and t is a BTA term, abbreviates 
tdabCt. We treat an expression of the form aot and the BTA term that it 
abbreviates as syntactically the same. 

Closed BTA terms are considered to represent threads. The thread 
represented by a closed term of the form t dal?’ models the behaviour 
that first performs a, and then proceeds as the behaviour modeled by the 
thread represented by ¢ if the reply from the execution environment is 1 
and proceeds as the behaviour modeled by the thread represented by t’ if 
the reply from the execution environment is 0. Performing tau, which is 
considered performing an internal action, always leads to the reply 1. The 
thread represented by S models the behaviour that does nothing else but 
terminate and the thread represented by D models the behaviour that is 
inactive, i.e. it performs no more basic actions and it does not terminate. 

BTA has only one axiom. This axiom is given in Table 2. 


Table 2: Axiom of BTA 


xditauby=adtaubs Tl 


Using the abbreviation introduced above, it can also be written as 
follows: x dtaub y = tauoz. 

Each closed BTA term represents a finite thread, i.e. a thread with 
a finite upper bound to the number of basic actions that it can perform. 
Infinite threads, i.e. threads without a finite upper bound to the number 
of basic actions that it can perform, can be defined by means of a set of 
recursion equations (see e.g. [5]). 
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A regular thread is a finite or infinite thread that can be defined by 
means of a finite set of recursion equations. The behaviours produced 
by PGA instruction sequences under execution are exactly the behaviours 
modeled by regular threads. 

Two infinite threads are considered identical if their approximations 
up to any finite depth are identical. The approximation up to depth n of a 
thread models the behaviour that differs from the behaviour modeled by the 
thread in that it will become inactive after it has performed n actions unless 
it would terminate at this point. AIP (Approximation Induction Principle) is 
a conditional equation that formalizes the above-mentioned view on infinite 
threads. In AIP, the approximation up to depth n is phrased in terms of 
the unary projection operator 7, :T —> T. 

The axioms for the projection operators and AIP are given in Table 3. 


Table 3: Axioms for the projection operators and AIP 


to(x) =D PRI 
Tn+1(D) =D PR2 
Tn+41(S) =$ PR3 


Tnti(t Jab y) =m,,(x) Jab a,(y) PR4 


An>o T(x) =Trly) > r=y AIP 


In this table, a stands for an arbitrary basic action from Ajay and n 
stands for an arbitrary natural number from N. We write BTA® for BTA ex- 
tended with the projection operators, the axioms for the projection operators, 
and AIP. 

By AIP, we have to deal in BTA® with conditional equational formulas 
with a countably infinite number of premises. Therefore, infinitary con- 
ditional equational logic is used in deriving equations from the axioms of 
BTA®. A complete inference system for infinitary conditional equational 
logic can be found in, for example, [26]. 

The depth of a finite thread is the maximum number of basic actions 
that the thread can perform before it terminates or becomes inactive. We 
define the function depth that assigns to each closed BTA term the depth of 
the finite thread that it represents: 


depth(t Jat’) = max{depth(t), depth(t’)} +1. 
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4 ‘Thread Extraction and Behavioural Congruence 


In this section, we make precise in the setting of BTA® which behaviours 
are produced by PGA instruction sequences under execution and introduce 
the notion of behavioural congruence on PGA instruction sequences. 

To make precise which behaviours are produced by PGA instruction 
sequences under execution, we introduce an operator |_|. For each closed 
PGA term t, |t| represents the thread that models the behaviour produced 
by the instruction sequence represented by t under execution. 

Formally, we combine PGA with BTA® and extend the combination 
with the thread extraction operator |_|: IS — T and the axioms given in 
Table 4. 


Table 4: Axioms for the thread extraction operator 


a|=aoD TEI #1| = D TE7 
a;X|=a0|X| TE2 |#0;X|=D TES 
+a|=a0D TES |#1;X|=|X| TE9 
+a;X|=|X| dab |#2;X| TE4 #1+2;ul| =D TE10 
—a|=aoD TES #1+2;u;X|=|#l+1;X| TE11 
—a;X|=|#2;X|<al|X| TE6 =S TE12 

Ex Ss TE13 


In this table, a stands for an arbitrary basic instruction from A, u stands 
for an arbitrary primitive instruction from Z, and | stands for an arbitrary 
natural number from N. We write PGA/BTA® for the combination of PGA 
and BTA® extended with the thread extraction operator and the axioms 
for the thread extraction operator. 

If a closed PGA term ¢ represents an instruction sequence that starts 
with an infinite chain of forward jumps, then TE9 and TE11 can be applied 
to |t| infinitely often without ever showing that a basic action is performed. 
In this case, we have to do with inaction and, being consistent with that, 
\t| = D is derivable from the axioms of PGA and TE1—-TE13. By contrast, 
|t| = D is not derivable from the axioms of PGA'* and TE1-TE13. However, 
if closed PGA terms ¢ and ¢’ represent instruction sequences in which no 
infinite chains of forward jumps occur, then t = t’ is derivable from the 
axioms of PGA only if |¢| = |¢’| is derivable from the axioms of PGA'* and 
TE1-TE13. 
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If a closed PGA term ¢ represents an infinite instruction sequence, 
then we can extract the approximations of the thread modeling the be- 
haviour produced by that instruction sequence under execution up to every 
finite depth: for each n € N, there exists a closed BTA term ¢” such that 
Ty(\t|) = t” is derivable from the axioms of PGA, TE1—-TE13, the ax- 
ioms of BTA, and PR1I-PR4. If closed PGA terms ¢ and ¢’ represent 
infinite instruction sequences that produce the same behaviour under ex- 
ecution, then this can be proved using the following instance of AIP: 
An>o Tn (ltl) = r(le']) => [el = |e. 

The following proposition, proved in [6], puts the expressiveness of PGA 
in terms of producible behaviours. 


Proposition 1 Let M be a model of PGA/BTA®™. Then, for each element p 
from the domain associated with the sort T in M, there exists a closed PGA 
term t such that p is the interpretation of |t| in. M iff p is a component of the 
solution of a finite set of recursion equations {V =ty |V € V}, where V is 
a set of variables of sort T and each ty is a BTA term that is not a variable 
and contains only variables from V. 


More results on the expressiveness of PGA can be found in [6]. 

PGA instruction sequences are behaviourally equivalent if they produce 
the same behaviour under execution. Behavioural equivalence is not a con- 
gruence. Instruction sequences are behaviourally congruent if they produce 
the same behaviour irrespective of the way they are entered and the way 
they are left. 

Let t and t’ be closed PGA terms. Then: 


e t and ¢’ are behaviourally equivalent, written t =p. t’, if |t| = |t’| is 
derivable from the axioms of PGA/BTA™; 


e t and t’ are behaviourally congruent, written t =p. t’, if, for each 
LneN, #l;¢;!" =p. #57 5!" 


Behavioural congruence is the largest congruence contained in behavioural 
equivalence. 


5 The Case of Instructions for Turing Tapes 


In this section, we present the instantiation of PGA in which the instructions 
taken as basic instructions are all possible instructions to read out or alter the 


°We use the convention that t; t’° stands for t. 
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content of the cell of a Turing tape under the tape’s head and to optionally 
move the head in either direction by one cell. 

The instructions concerned are meant for Turing tapes of which each cell 
contains a symbol from the input alphabet {0,1} or the symbol u. Turing 
proposed computing machines with a tape of which each cell contains a 
symbol from a finite alphabet, the so-called tape alphabet, that includes the 
input alphabet {0,1} and the symbol ui (see [25]).° The tape alphabet may 
differ from one machine to another. The choice between the tape alphabet 
{0,1,u} and any tape alphabet that includes {0,1,u} is rather arbitrary 
because it has no effect on the computability and the order-of-magnitude 
time complexity of partial functions from ({0, 1}*)” to {0,1}* (n > 0). We 
have chosen for the tape alphabet {0,1,u} because it allows of presenting 
part of the material to come in a more comprehensible manner. 

In the present instantiation of PGA, it is assumed that a fixed but 
arbitrary set F of foci has been given. Foci serve as names of Turing tapes. 

The set of basic instructions used in this instantiation consists of the 
following: 


for each p: {0,1,u} > {0,1}, q: {0,1,u} > {0,1,u}, d € {-1,0, 1}, 
and f € F, a basic Turing-tape instruction f.p/(q,d). 


We write Ait for this set. 

Each basic Turing-tape instruction consists of two parts separated by a 
dot. The part on the left-hand side of the dot plays the role of the name 
of a Turing tape and the part on the right-hand side of the dot plays the 
role of an operation to be carried out on the named Turing tape when the 
instruction is executed. The intuition is basically that carrying out the 
operation concerned produces as a reply O or 1 depending on the content of 
the cell under the head of the named Turing tape, modifies the content of 
this cell, and optionally moves the head in either direction by one cell. More 
precisely, the execution of a basic Turing-tape instruction f.p/(q,d) has the 
following effects: 


e if the content of the cell under the head of the Turing tape named f 
is b when the execution of f.p/(q,d) starts, then the reply produced 
on termination of the execution of f.p/(q, d) is p(b); 


®In many publications in which Turing machines are defined, the input alphabet may 
even be any non-empty finite set of symbols. 
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e if the content of the cell under the head of the Turing tape named f 
is b when the execution of f.p/(q,d) starts, then the content of this 
cell is g(b) when the execution of f.p/(q, d) terminates; 


e if the cell under the head of the Turing tape named f is the ith cell of 
the tape when the execution of f.p/(q, d) starts, then the cell under the 
head of this Turing tape is the max(i + d,1)th cell when the execution 
of f.p/(q, d) terminates. 


The execution of f.p/(q,d) has no effect on Turing tapes other than the one 
named f. 

We write [PGA/BTA™](At:) for PGA/BTA® with A instantiated 
by Ait. Notice that [PGA/BTA™](Ai,) is itself parameterized by a set 
of foci. 

Some functions from {0,1,u} to {0,1,u} are: 


e the function 0?, satisfying 0?(0) = 1 and 0?(1) =0 and 0?(u) = 0; 
e the function 1?, satisfying 1?(0) = 0 and 1?(1) =1 and 1?(u) =0; 
e the function uU?, satisfying u?(0) = 0 and u?(1) = 0 and u?(u) = 1; 
e the function 0, satisfying 0(0) = 0 and 0(1) = 0 and 0(u) = 0; 
e the function 1, satisfying 1(0) = 1 and 1(1) = 1 and O(u) = 1; 
e the function u, satisfying 1(0) =u and 1(1) =u and O(u) =u; 
e the function i, satisfying i(0) = 0 and i(1) =1 and 0(u) =u; 
e the function c, satisfying c(0) = 1 and c(1) =0 and O(u) =u. 


The first five of these functions are also functions from {0,1,u} to {0,1}. 
For some instances of p/(q, d), we introduce a special notation. We write: 


test:0 for 0?/(i,0) , set:0:d for 1/(0,d) , skip:d for 1/(i,d), 
test:1 for 1?/(i,0) , set:l:d for 1/(1,d) , 
test:u for u?/(i,0) , set:u:d for 1/(u,d) , 


where d € {—1,0, 1}. 
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6 Turing-Tape Families 


PGA instruction sequences under execution may interact with the named 
services from a family of services provided by their execution environment. 
In applications of PGA, the services provided by an execution environment 
vary from Boolean registers via Turing tapes to random access memories of 
actual computers.’ In this section, we consider service families in which the 
services are Turing tapes and introduce an algebraic theory of Turing-tape 
families called TTFA (Turing-Tape Family Algebra). 

A Turing-tape state is a pair (7,7), where 7: N, > {0,1,u} andi € Nj, 
satisfying the condition that, for some j € Nj, for all k EN, rj +k) =u. 
We write S for the set of all Turing-tape states. 

Let (7,7) be a Turing-tape state. Then, for all 7 € Ni, 7(j) is the 
content of the jth cell of the Turing tape concerned and the ith cell is the 
cell under its head. 

Our Turing tapes are one-way infinite tapes. Turing proposed comput- 
ing machine with two-way infinite tapes (see [25]). In many publications 
in which Turing machine are defined, Turing machines are a variant of 
Turing’s computing machines with one or more one-way infinite tapes (cf. 
the textbooks [1, 2, 18, 19, 20, 24]). The choice between one-way infinite 
tapes and two-way infinite tapes is rather arbitrary because it has no effect 
on the computability and the order-of-magnitude time complexity of partial 
functions from ({0,1}*)” to {0,1}* (n > 0). We have chosen for one-way 
infinite tapes because it allows of presenting part of the material to come in 
a more comprehensible manner. 

In Section 7, we will use the notation (7:i++b). For each r:N; > {0,1,u}, 
i € Ny, and b € {0,1,u}, (7: i++ 6b) is defined as follows: (7: i> 6)(7) = 
and, for all 7 € N; with j 47, (r:i+ b)(j) = Ty). 

In TTFA, as in [PGA/BTA™](Ait), it is assumed that a fixed but 
arbitrary set F of foci has been given. 

TTFA has one sort: the sort TTF of Turing-tape families. To build 
terms of sort TTF, TTFA has the following constants and operators: 


e the empty Turing-tape family constant 0 :— TTF; 


e for each f € F and s € SU {x}, the singleton Turing-tape family 
constant f.tt(s):— TTF; 


7A Boolean register consists of a single cell that contains a symbol from the alphabet 
{0,1}. Carrying out an operation on a Boolean register produces as a reply 0 or 1, 
depending on the content of the cell, and/or modifies the content of the cell. 
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e the binary Turing-tape family composition operator _ ® _: TTF x 
TTF > TTF; 


e for each F C F, the unary encapsulation operator Op : TTF > TTF. 


We assume that there are infinitely many variables of sort TTF, includ- 
ing u,v,w. We use infix notation for the Turing-tape family composition 
operator. 

The Turing-tape family denoted by @) is the empty Turing-tape family. 
The Turing-tape family denoted by a closed term of the form f.tt(s), where 
s € S, consists of one named Turing tape only, the Turing tape concerned 
is an operative Turing tape named f whose state is s. The Turing-tape 
family denoted by a closed term of the form f.tt(*) consists of one named 
Turing tape only, the Turing tape concerned is an inoperative Turing tape 
named f. The Turing-tape family denoted by a closed term of the form t@ t’ 
consists of all named Turing tapes that belong to either the Turing-tape 
family denoted by t or the Turing-tape family denoted by ¢’. In the case 
where a named Turing tape from the Turing-tape family denoted by t and 
a named Turing tape from the Turing-tape family denoted by t’ have the 
same name, they collapse to an inoperative Turing tape with the name 
concerned. The Turing-tape family denoted by a closed term of the form 
Or(t) consists of all named Turing tapes with a name not in F’ that belong 
to the Turing-tape family denoted by t. 

An inoperative Turing tape can be viewed as a Turing tape whose state 
is unavailable. Carrying out an operation on an inoperative Turing tape is 
impossible. 

The axioms of TTFA are given in Table 5. 


Table 5: Axioms of TTFA 
ueb=u TTFCl ap(0)=0 TTFE1 
u@v=veu TTFC2 dpr(f.tt(s))=0 if fe F TTFE2 
(u@v)@w=ue(vew) TTFC3 Or(f.tt(s)) = ftt(s) if f é F TTFE3 
f.tt(s) ® ftt(s’) = f.tt(x) TTFC4 Or(u@v) =Or(u) GOr(v) TTFE4 


In this table, f stands for an arbitrary focus from *, F stands for an 
arbitrary subset of F, and s and s’ stand for arbitrary members of S U {x}. 
These axioms simply formalize the informal explanation given above. 

The following proposition, proved in [6], is a representation result for 
closed TTFA terms. 
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Proposition 2 For all closed TTFA terms t, for all f € F, either t = 
Ox py (t) is derivable from the axioms of TTFA or there exists an s © SU {x} 
such that t = f.tt(s) ® Or py (t) is derivable from the axioms of TTFA. 


In Section 8, we will use the notation @;_, t;. For each i € Nj, let t; be 
a terms of sort TTF. Then, for each n € Nj, the term @j_, t; is defined by 
induction on 7 as follows: aan t; = t, and am = aaa t; B tn41. We 
use the convention that @j_, t; stands for 0 if n = 0. 


7 Interaction of Threads with Turing Tapes 


If instructions from A;, are taken as basic instructions, a PGA instruction 
sequence under execution may interact with named Turing tapes from a 
family of Turing tapes provided by its execution environment. In line with 
this kind of interaction, a thread may perform a basic action basically for 
the purpose of modifying the content of a named Turing tape or receiving a 
reply value that depends on the content of a named Turing tape. In this 
section, we introduce related operators. 

We combine PGA/BTA™ (Ai) with TTFA and extend the combination 
with the following operators for interaction of threads with Turing tapes: 


e the binary use operator _/_:T x TTF > T; 
e the binary apply operator _e_:T x TTF > TTF; 
e the unary abstraction operator Tray : T > T; 


and the axioms given in Tables 6.8 

In these tables, f stands for an arbitrary focus from ¥, p stands for an 
arbitrary function from {0,1,u} to {0,1}, q stands for an arbitrary function 
from {0,1,u} to {0,1,u}, d stands for an arbitrary member of {—1,0,1}, 7 
stands for an arbitrary function from Nj, to {0,1,u}, 7 stands for an arbitrary 
natural number from Nj, n stands for an arbitrary natural number from N, 
and ¢ and ¢’ stand for arbitrary terms of sort TTF. We use infix notation 
for the use and apply operators. We write [PGA/BTA™](Ai,)/TTI for 
the combination of [PGA/BTA®™](A;;) and TTFA extended with the use 
operator, the apply operator, the abstraction operator, and the axioms for 
these operators. 


8We write t[t’/a] for the result of substituting term t’ for variable x in term t. 
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Table 6: Axioms for the use, apply and abstraction operator 

S/u=S U1 
D/u=D U2 
(tauoxz) /u=tauo (a / wu) U3 
(x Ifp/(adey)/ AH (u) = (@/ AH) Shv/(a ODE (y/ AF (w)) U4 

(x I f-p/(q, 4) © y) / (f-tt(r, t) @ Of py (w)) 
= tauo (x / (f.tt((7 7+ q(7(t))), max(i + d,1)) 6 Or py (u))) if p(r(Z)) = 1 US 

(x I f-p/(q,d) 2 y) / (f-tt(7, 4) © Of 73 (u)) 
=tauo (y/ (f.tt((7 : i+ g(r (t))), max(2 + d,1)) © O¢¢y(u))) if p(7(@)) =0 U6 
(x I f-p/(q, 4) © y) / (f-tt(*) ® Arp} (u)) =D U7 
Tr(a /Uu) =Tn(a) /u U8 
Seu=u Al 
Deu=9 A2 
(tauoz)eu=tauo(reu) A3 
(xIfp/(a.deby)eApy(u) =0 A4 

(x I f-p/(q,d) 2 y) @ (f-tt(r, 4) © Of 7} (u)) 
= xe (f.tt((7T: i+ g(7(2))), max(i + d,1)) © Of} (u)) if p(r(2)) = 1 A5 

(x I f.p/(q, 4) © y) @ (f.tt(r, 7) © Of p(w) 
= ye(f.tt((7 27+ g(7(2))), max(2 + d, 1)) B Og ¢}(u)) if p(r(2)) =0 AG 
(x If.p/(q,d) by) (f.tt(*) A (u)) =0 AT 
Agen tlte(2)/2] = tle (y)/2] > t[x/2] = tly/2] A8 
Ttau(S) = Cl 
Trau(D) = C2 
Trau (tau o — = Ttau (2) C3 
Trau(@ I f.p/(g,4) © y) = Trau(x) J f-p/ (4,4) © Trau(y) C4 
Ax>0o Ttau(™(2)) = Trau(te(y)) => Ttau(@) = Tau (y) C5 


Axioms U1—-U7 and Al—A7 formalize the informal explanation of the 
use operator and the apply operator given below and in addition stipulate 
what is the result of apply if an unavailable focus is involved (A4) and what 
is the result of use and apply if an inoperative Turing tape is involved (U7 
and A7). Axioms U8 and A8 allow of reasoning about infinite threads, and 
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therefore about the behaviour produced by infinite instruction sequences 
under execution, in the context of use and apply, respectively. 

On interaction between a thread and a Turing tape, the thread affects 
the Turing tape and the Turing tape affects the thread. The use operator 
concerns the effects of Turing tapes on threads and the apply operator 
concerns the effects of threads on Turing tapes. The thread denoted by a 
closed term of the form t/t’ and the Turing-tape family denoted by a closed 
term of the form tet’ are the thread and Turing-tape family, respectively, 
that result from carrying out the operation that is part of each basic action 
performed by the thread denoted by ¢ on the Turing tape in the Turing-tape 
family denoted by t/ with the focus that is part of the basic action as its 
name. When the operation that is part of a basic action performed by a 
thread is carried out on a Turing tape, the content of the Turing tape is 
modified according to the operation concerned and the thread is affected as 
follows: the basic action turns into the internal action tau and the two ways 
to proceed reduce to one on the basis of the reply value produced according 
to the operation concerned. 

With the use operator the internal action tau is left as a trace of each 
basic action that has led to carrying out an operation on a Turing tape. The 
abstraction operator serves to abstract fully from such internal activity by 
concealing tau. Axioms C1—C4 formalizes the concealment of tau. Axiom C5 
allows of reasoning about infinite threads in the context of abstraction. 

The following two theorems are elimination results for closed 
[PGA/BTA™](Ai)/TTI terms. 


Theorem 1 For all closed |PGA/BTA®™](Ai,)/TTI terms t of sort T in 
which all subterms of sort IS are repetition-free, there exists a closed 
[PGA/BTA®](Ait) term t’ of sort T such that t = t’ is derivable from 
the axioms of [PGA/BTA™](Att) /TTL 


Proof: It is easy to prove by structural induction that, for all closed 
repetition-free [PGA/BTA®™](Ai;) terms s of sort IS, there exists a closed 
[PGA/BTA®](Att) term s’ of sort T such that |s| = s’ is derivable from 
the axioms of [PGA/BTA®™](Aj,). Therefore, it is sufficient to prove the 
proposition for all closed [PGA/BTA™](A¢¢)/TTI terms t¢ of sort T in 
which no subterms of sort IS occur. This is proved similarly to part (1) of 
Theorem 3.1 from [6]. 


Theorem 2 For all closed [PGA/BTA™](At)/TTI terms t of sort TTF 
in which all subterms of sort IS are repetition-free, there exists a closed 
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[PGA/BTA™](Att) term t! of sort TTF such that t =t' is derivable from 
the axioms of [PGA/BTA®](A¢)/TTI. 


Proof: As in the proof of Theorem 1, it is sufficient to prove the 
proposition for all closed [PGA/BTA™](A¢)/TTI terms t of sort TTF in 
which no subterms of sort IS occur. This is proved similarly to part (2) of 
Theorem 3.1 from [6]. 


8 Computing Partial Functions from ({0,1}*)” to 
{0, 1}° 


In this section, we make precise in the setting of the algebraic theory 
[PGA/BTA™](Ai)/TTI what it means that a given instruction sequence 
computes a given partial function from ({0,1}*)” to {0,1}* (n € N). 

We write Fk, where k € Nj, for the set {t:2 | 1 <i < k} of foci. We 
write [[PGA/BTA™](Ait)/TTI](F*) for [PGA/BTA®](Ay,)/TTI with F 
instantiated by FX. 

Below, we use the function c:{7:N; — {0,1,u} | (7,1) © S} — {0,1,u}* 
for extracting the content of a Turing tape. This function is defined as follows: 


for all n € Nj, for all by,...,b, € {0,1,u}, 
C7) bi >.< bp, ME F(t) = b for all Vm, 72) =u tor all ¢> 4, and 


T(n) #3 
or) =e iff r(¢) =u for alli > 1.9 


Let k € Nj, let t be a closed [[PGA/BTA™](Att)/TTI](F) term of 
sort IS, let n EN, let F': ({0,1}*)” + {0,1}*,!° and let T: NN. Then ¢ 
computes F with k tapes in time T if: 


e for all wy,...,Wn € {0,1}* such that F'(w1,..., wn») is defined, 
there exist (7), 11), ---,(7,_1,t%-1) € S such that 


[él © pay try-tt(7y, 1) = QHay ty-tt(7), 45) ® tek.tt(rf, 1) , 
depth(|t| / Ory toy.tt(ry, 1)) < TP len(an) len wy) , 


where 


°We write € for the empty string. 
We write f: A-+ B to indicate that f is a partial function from A to B. 
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7 is the unique 7: N, — {0,1,u} with (7,1) € S and 


C(t San) ties Lia, 
for 7 £ 1, 7; is the unique 7: N; > {0,1,u} with (7,1) € S and 
c(T) =, 


7, is the unique 7:N, — {0,1,u} with (7,1) € S and 
Clr) =F ig acta): 


e for all wy,...,Wn € {0,1}* such that F'(w1,..., wp) is undefined, 
lt] © BF_, tey-tt(7),1) =9, 
where 


7 is the unique 7: N, — {0,1,u} with (7,1) € S and 


CT) SW Lew, 
for j £1, 7; is the unique 7: Nj > {0,1,u} with (7,1) € S and 
ela) =. 


We say that t computes F in time T if there exists a k € Ny, such that t 
computes F’ with k tapes in time 7’, and we say that t computes F if there 
exists a I’: N > N such that t computes F in time T. 

With the above definition, we can establish whether an instruction 
sequence of the kind considered in [[PGA/BTA®™](Ait) /TTI](F£) (k € Ni) 
computes a given partial function from ({0,1}*)” to {0,1}* (n € N) by 
equational reasoning using the axioms of [[PGA/BTA®™] (Att) /TTI] (FX). 

A single-tape Turing-machine program is a closed 
[[PGA/BTA®](Ait)/TTI](Fi) term of sort IS that is of the form 
(ti ;...3 tn)”, where each t; is of the form 


test:0 ; #3 ; set:bo:do ; uo ; 
test:1 ; #3 ; set:by:d, 3 uy; 
test:u ; #3; set:by:d,; uy, 
where bo, 61, 0, € {0,1,u}, do, di,d, € {-1,0,1}, and 
uo is of the form #l with 1 € {12-1 +9|0<i<n} or #0 or}, 


uz is of the form #l with 1 € {12-1 +5|0<i <n} or #0 or}, 


uu is of the form #1 with | € {12-1+1|0<i<n} or #0 or!. 
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We refrain from defining a k-tape Turing-machine program (for k > 1), 
which is much more involved than defining a single-tape Turing-machine 
program. However, we remark that the theorems given below go through for 
k-tape Turing-machine programs. 

The following theorem is a result concerning the computational power 
of single-tape Turing-machine programs. 


Theorem 3 For each F : ({0,1}*)" + {0,1}*, there exists a single-tape 
Turing-machine program t such that t computes F iff F is Turing-computable. 


Proof: For each F': ({0, 1}*)” + {0,1}*, F is Turing-computable iff there 
exists a Turing machine with a single semi-infinite tape and stay option that 
computes F’. There is an obvious one-to-one correspondence between the 
transition functions of such Turing machines and single-tape Turing-machine 
programs by which the Turing machines concerned can be simulated when 
they are applied to a single tape. Hence, for each F': ({0,1}*)” + {0,1}*, 
there exists a single-tape Turing-machine program t such that t computes F' 
iff F' is Turing-computable. 


Below, we write TMP, for the set of all single-tape Turing-machine 
programs, and POLY for {T | T:N > NA Tisapolynomial function}. 

The following theorem is a result relating the complexity class P to the 
functions that can be computed by a single-tape Turing-machine program 
in polynomial time. 


Theorem 4 P is equal to the class of all functions F': {0,1}* — {0,1} for 
which there exist ant € TMP. and aT € POLY such that t computes F 
in time T. 


Proof: This follows from the proof of Theorem 3 and the fact that, if a 
function F’: {0,1}* + {0,1} is computed on a Turing machine in time T, 
then the one-to-one correspondence referred to in the proof of Theorem 3 
yields for this Turing machine a single-tape Turing-machine program that 
computes F' in a time of O(T). 


We think that Theorems 3 and 4 above provide evidence of the claim that 
[PGA/BTA™](Att)/TTI is a suitable setting for the development of theory 
in areas such as computability and computational complexity. Moreover, in 
this setting variations on Turing machines that have not attracted attention 
yet come into the picture and can be studied. 
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9 A Turing-Machine Program Example 


In this section, we give a simple example of a Turing-machine program. We 
consider the non-zeroness test function NZT :({0,1}*)' > {0,1}* defined by 


NZT(b, ... bn) =0 if b} =Oand ... andb, =0, 
NAO 2.2 by Wd bp a Or 225: Or by = 1 


NZT models the function nzt:N + N defined by nzt(0) = 0 and nzt(k+1) =1 
with respect to the binary representations of the natural numbers. 


We define a Turing-machine program NZTIS that computes NZT 
as follows: 


NZTIS & (—test:0 ; #3 ; set:0:1 ; #33; 
—test:1 ; #3 ; set:1:1 ; #29; 
—test:u ; #3; set:u:—1; #1; 
—test:0 ; #3 ; set:u:—1 ; #33; 
—test:1 ; #3; set:u:—1; #5; 
—test:u ; #3 ; set:0:0;!; 
—test:0 ; #3 ; set:u:—1 ; #33; 
—test:1 ; #3; set:u:—1; #29; 
—test:u ; #3; set:1:0;!)”. 


First, the head is moved to the right cell-by-cell until the first cell whose 
content is u has been reached and after that the head is moved to the left 
by one cell. Then, the head is moved to the left cell-by-cell until the first 
cell of the tape has been reached and on top of that the content of each cell 
that comes under the head is replaced by u. Finally, the content of the first 
cell is replaced by 1 if at least one cell with content 1 has been encountered 
during the moves to the left and the content of the first cell is replaced by 0 
if no cell with content 1 has been encountered during the moves to the left. 

Because Turing-machine programs closely resemble the transition func- 
tions of Turing machines, they have built-in inefficiencies. We use NZTIS 
to illustrate this. We define an instruction sequence NZTIS’ that computes 
NZT according to the same algorithm in less time than NZTIS as follows: 
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NZTIS' © (+test:u ; #3 ; skip:1 ; #18 ; 
skip:—1 ; 
—test:0 ; #3; set:u:—1; #18; 
—test:1 ; #3; set:u:—1; #3; 


set:0:0 ;!; 
+test:u ; #3; set:u:—1 ; #18; 
set:1:0;!)” . 


In NZTIS', which is clearly not a single-tape Turing-machine program, all 
instructions of the form test:b that are redundant or can be made redundant 
by using instructions of the form skip:d are removed. 

In [13], we have presented instruction sequences that compute the 
restriction of NZT to {0,1}”, for n > 0. The instruction sequences concerned 
are instruction sequences that, under execution, can act on Boolean registers 
instead of Turing tapes. 


10 Concluding Remarks 


We have presented an instantiation of a parameterized algebraic theory of 
single-pass instruction sequences, the behaviours produced by such instruc- 
tion sequences under execution, and the interaction between such behaviours 
and components of an execution environment. The parameterized theory 
concerned is the basis of a line of research in which issues relating to a wide 
variety of subjects from computer science have been rigorously investigated 
thinking in terms of instruction sequences (see [21]). In the presented in- 
stantiation of this parameterized theory, all possible instructions to read 
out or alter the content of the cell of a Turing tape under the tape’s head 
and to optionally move the head in either direction by one cell are taken 
as basic instructions and Turing tapes are taken as the components of an 
execution environment. 

The instantiated theory provides a setting for the development of 
theory in areas such as computability and computational complexity that 
distinguishes itself by offering the possibility of equational reasoning and 
being more general than the setting provided by a known version of the 
Turing-machine model of computation. Many known and unknown versions 
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of the Turing-machine model of computation can be dealt with by imposing 
apposite restrictions. 


We have defined the notion of a single-tape Turing-machine program 
in the setting of the instantiated theory and have provided evidence for 
the claim that the theory provides a suitable setting for the development 
of theory in areas such as computability and computational complexity. 
Single-tape Turing-machine programs and multiple-tape Turing-machine 
programs make up only small parts of the instruction sequences that can 
be considered in this setting. This largely explains why it is more general 
than the setting provided by a known version of the Turing-machine model 
of computation. From our experience in previous work with a comparable 
algebraic theory of instruction sequences, with instructions that operate on 
Boolean registers instead of Turing tapes, we expect that the generality is 
conducive to the investigation of novel issues in areas such as computability 
and computational complexity. 


The given presentation of the instantiated theory is set up in a way 
where the introduction of services, the generic kind of execution-environment 
components from the parameterized theory in question, is circumvented. 
In [12], the presentation of another instantiation of the same parameterized 
theory, with instructions that operate on Boolean registers, is set up in 
the same way. The distinguishing feature of this way of presenting an 
instantiation of this parameterized theory is that it yields a less involved 
presentation than the way adopted in earlier work based on an instantiation 
of this parameterized theory. 


The closed terms of the instantiated theory that are of sort IS can be 
considered to constitute a programming language of which the syntax and 
semantics is defined following an algebraic approach. This approach is more 
operational than the usual algebraic approach which is among other things 
followed in [14, 15, 17]. A more operational approach is needed to make it 
possible to investigate issues in the area of computational complexity. 


Broadly speaking, the work presented in this paper is concerned with 
formalization in subject areas, such as computability and computational 
complexity, that traditionally relies on a version of the Turing-machine model 
of computation. To the best of our knowledge, very little work has been 
done in this area. Three notable exceptions are [3, 22, 28]. However, in 
those papers, formalization means formalization in a theorem prover (Matita, 
HOL4, Isabelle/HOL). Little or nothing is said in these papers about the 
syntax and semantics of the notations used — which are probably the ones 
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that have to be used in the theorem provers. This makes it impracticable 
to compare the work presented in those papers with our work, but it is of 
course clear that the approach followed in the work presented in those papers 
is completely different from the algebraic approach followed in our work. 
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